246 ◾ Bioinformatics
DREME is depreciated by the developer. So, it is recommended to use MEME or
STREME instead. Since we wish to find motifs in more than 50 FASTA sequences, we will
use STREME program. STREME will attempt to find motifs by iterating five steps until the
user-specified stopping criterion is met. The stopping criterion can be either the p-value of
the motif or the total number of motifs found. The five steps include building of a suffix
tree for both the ChIP-Seq FASTA sequences and control sequences, evaluation of the seed
words, motif refinement, motif significance computation, and motif erasing (converting
the site of the best motif to N in the primary sequences and control sequences).
The following STREME command searches for motifs in the FASTA sequences of the
three ChIP-Seq samples:
streme --p chip1_peaks.fasta \
--oc streme_motifs_chip1 \
--dna \
--thresh 0.05
streme --p chip2_peaks.fasta \
--oc streme_motifs_chip2 \
--dna \
--thresh 0.05
FIGURE 6.19 The motifs found by DREME.